Generating Realistic Kanji Character Images from On-Line Patterns
نویسندگان
چکیده
The availability of a large sample database is very important to design high accuracy classifiers for handwritten character recognition. Collecting image samples from human writers and practical documents is expensive particularly for large character sets, like with East-Asia-languages. We can therefore take advantage of existing on-line databases to generate additional off-line images. This paper proposes a method to generate realistic character images from on-line patterns. From the pen trajectory of an on-line pattern, the proposed method can generate numerous images of various stroke shapes using three painting modes: constant line mode, proportional mode and calligraphic mode. Particularly, the calligraphic mode combines the pen trajectory (representing the writing style of one concrete writer) with real stroke images (also representing individual writing style of a concrete writer) to generate character images that look as if they were produced with brush or pen by
منابع مشابه
An Improved Approach to Generating Realistic Kanji Character Images from On-Line Characters and its Benefit to Off-Line Recognition Performance
This paper proposes a method for generating realistic calligraphic Kanji character images from on-line data. The proposed method is an improvement of our former method presented in [1]. Our new method can cope also with connected on-line strokes, i.e., stroke number variations, which were not correctly painted in our previous method. The new method decomposes strokes into three different parts ...
متن کاملModeling of Pen-Coordinate Information in SCPR-based HMM for On-line Recognition of Handwritten Japanese Characters
This paper describes stochastic modeling of pencoordinate information in HMMs with structured character pattern representation (SCPR) for on-line recognition of handwritten Japanese characters. SCPR allows HMMs for Kanji character patterns to share common subpatterns. Although SCPR-based HMMs have been successfully applied to Kanji character recognition, the pen-coordinate feature has not been ...
متن کاملRecent Results of Online Japanese Handwriting Recognition and Its Applications
This paper discusses online handwriting recognition of Japanese characters, a mixture of ideographic characters (Kanji) of Chinese origin, and the phonetic characters made from them. Most Kanji character patterns are composed of multiple subpatterns, called radicals, which are shared among many (sometimes hundreds of) Kanji character patterns. This is common in Oriental languages of Chinese ori...
متن کاملEffects of a Large Amount of Artificial Patterns for On-line Handwritten Japanese Character Recognition
This paper describes effects of a large amount of artificial patterns to train an on-line handwritten Japanese character recognizer. We need a huge amount of pattern samples to train recognizers to achieve high recognition performance for on-line handwritten character recognition. However, the existing pattern samples are not enough. We construct distortion models to generate a large amount of ...
متن کاملEvaluation of the SVM based Multi-Fonts Kanji Character Recognition Method for Early-Modern Japanese Printed Books
The national diet library in Japan provides a web based digital archive for early-modern printed books by image. To make better use of the digital archive, the book images should be converted to text data. In this paper, we evaluate the SVM based multi-fonts Kanji character recognition method for early-modern Japanese printed books. Using several sets of Kanji characters clipped from different ...
متن کامل